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1. Introduction 

In [1] Yager provides an example in which the flat representation 
leads to unsatisfactory results. Consider a rule base consisting to two rules 
ifU is 12 the Vis 29 I. 


[2] of fuzzy if-then rules 


IfU is [10- 15] the Vis [25-30] II. , , 

If U = 12 we would get VisG where G = [25 - 30]. The application of the defuzzification process 
leads to a selection of V = 27.5. Thus we see that the very specific instruction was not followed. 

The problem with the technique used is that the most specific information was swamped by 
the less specific information. In this paper we shall provide for a new structure for the 
representation of fuzzy if-then rules. The representational form introduced here is called a 
Hierarchical Prioritized Structure (HPS) representation. Most importantly in addition to overcoming 
the problem illustrated in the previous example this HPS representation has an inherent capability to 
emulate the learning of general rules and provides a reasonable accurate cognitive mapping of how 
human beings store information. 


2. Hierarchical Prioritized Structure 

Figure 1, shows in a systematic view the of representation of the function V = t(U) by this 
new HPS representation. The overall function f, relating the input U to the output V , is comprised 
of the whole collection of subboxes, denoted fj. Each subbox is a collection of rules relating the 
system input, U, and the current iteration of the output, Vj_i, to a new iteration of the output. The 
output of the n^ subsystem, V n , becomes the overall output of the system, V. In the HPS the 
higher priority boxes, for i < j we say thatfi has a higher priority thanfj, would have less general 
information, consist of rules with more specific antecedents then those of lower priority. As we 
envision this system working an input value for U is provided, if it matches one or more of the rules 
in the first (highest priority) level then it doesn't bother to fire any of the less specific rules in the 


lower priority levels. 



Figure 1 Hierarchical Prioritized Structure 

In the following we describe the formal operation of this HPS. As we indicated Vj denotes 
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the output of the j^ 1 level. We shall assume Vq = O. In the HPS we shall use the variable Vj to 

indicate the maximum membership grade associated with the output of the level, Vj. 

In the HPS each fj (accept for the lowest level, j = n) is a collection of nj rules 

When U is Aj j is certain and Vj_i is low then Vj is Bji I 

The representation and aggregation of rules at each level is of the standard Mamdani type[2], 
disjunction of the individual rules. If Bj is the value obtained from the aggregation of the outputs of 
the collection of individual rules in I then the output of this subbox is 

Vj = V j _ 1 UBj. 

✓s /\ 

In I a rule fires if we are certain that the input U lies in Ajj and Vj_i is low. Since Vj_i is 
the maximum membership grade of Vj_] it can be seen as a measure of how much matching we 
have up to this point. Essentially this term is saying that if the higher priority rules are relevant, 
V;_i is not low, then don't bother using this information. On the other hand if the higher priority 

rules are not relevant, not to much matching Vj_i is low, then try using this information. 

The representation of the box fn is a collection of rules 

Xv 

When U is A nl and V n _i is low then V is B n i II 

plus the rule V = V n = V n _| LJ B n . The notable difference between the lowest priority box and the 

other ones is that the antecedent regarding U is certainly quality in the higher boxes. The need for 
this becomes apparent when the input is not a singleton. 

In the HPS structure Vj_i is the highest membership grade in Vj_j and as such the term 

Vj i is low is used to measure the degree to which the higher prioritized information have matched 
the input data. We note that low is a fuzzy subset on the unit interval. One definition for low [1] is 

low (x) = 1 - x. 

In [1] Yager looks at the formal operation of this kind of HPS we shall present the results 
obtained in [1]. We shall denote 3jj as the degree of firing (or relevancy) of Ay under the input, if 

the input is U = x* then 3y = Ay(x*). We shall denote gj = Max y Gj (y) = Poss[Gj]. We let 
ni 

Tj = u 3jj a Bjj T the aggregation of the rules in the ith level for input U, it is essentially the 
j=l 

contribution of the ith subsystem using the Mamdani type reasoning.. We shall let Gj be the output 
of the ith subbox, that is Vj = Gj. In [1] it is shown that 

Gj(y) = (Tj(y) a (1 - gi _i)) v Gj_i(y). Ill 

We notice that the term (1 - gj_i) bounds the allowable contribution of the i 1 * 1 subsystem to 

the overall output. We see that as we get at least one element y to be, a good answer (an element in 
Gj_i) we limit the contribution of the lower priority subsystems. It is this characteristic of a kind of 

saturation along with the prioritization that allows us to avoid the problem described earlier. 

In the following we suggest a modification of the above that leads to a more suitable 

formulation to the aggregation between the levels of the HPS [1]. We can replace a by another t- 
norm operator product * and replace v by another t-conorm, bounded sum, a [+] b = Min(l, 
a+b)[3]. Thus we get 

Gi(y) = Tj(y) * (1 - gj_i) Q Gj_i(y). 

However we note that since gj_i = Max y Gj_] then Tj(y) * (1 - gi_i ) ^ Gj_j(y) hence 
Tj(y) * (1 - gj_i) + Gj_i(y) < 1 thus we can replace [+] by +. This gives us the formulation 
Gi(y) = Tj(y) * (1 - gi_i) + Gj_i (y) (IV) 

Gj(y) = Tj(y) * (1 - Poss[Gj_i]) + Gj_i(y) 

What is happening in this structure is that as long as we have not found one y with 
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membership grade 1 in Gj_i, Poss[Gi_il * 1, we add some of ihe output of the current subbox to 
what we already have. Each element y, gets 1 - Poss[G W ] portion of the contnbuoon at that level, 

rp # / \ 

1 ’ We should point out that the aggregation performed in the hierarchical structure, whether we 
use III or IV is not a pointwise operation. This means that the value of G,(y) doesn t only depe 
on the membership grade of y in G|_] and Tj but on membership grades at other points In 
particular through the term gj_i = 1 - Max y G|(y) it depends upon the membership grade of all 

elements from Y in Gi-1. . . , „ 

We should note that implicit in this structure is a new kind of aggregation. Assume A and B 

are two fuzzy subsets we define the combination of these sets as the fuzzy subset D, denoted 

D = y(A, B) where 

D(x) = (1 - Poss(A)) * B(x) + A(x). 

3 ReDresentation and Operation of the HPS . , , 

In the previous part we have described the formal mechanism used for the reasoning and 
aggregation process in the HPS. While the formal properties of the new aggregation stnicture are 
important a key to the usefulness of the HPS in fuzzy modeling is the semantics used in the 

reoresentation of the information via this structure. , 

p In constructing an HPS representation to model a system we envision that the knowledge of 
the relationsMpcontamed in the HPS structure be stored in the following manner. At the highest 
level of priority, i = 1, we would have the most specific precise knowledge. In particular we would 

have point to point relationships, 

When U is 3 then V is 7 
When U is 9 then V is 13 

This would be information we know with the greatest certainty. . . 

At the next level of priority the specificity of the antecedent linguistic variables, the 
A 2 ;’s, would decrease. Thus the second level would contain slightly more general knowledge. 

J ' Essentially what we envision is that at the highest level we have specific point information. 
The next level encompass these points and in addition provides a more general and perhaps fuzzy 
knowledge. We note that the lowest most level can be used to tell us what to do if we have no 
knowledge up to this point. In some sense the lowest level is a default value. 

Example: Assume we are using an HPS representation to model a function V = f(U), where the 
base set for U is [0, 100]. A typical HPS representation could be as follows. 

LEVEL #1 

Rj j When U is 5 then V is 13 

R |2 When U is 75 then V is 180 

Rjj When U is 85 then V is 100 

LEVEL #2 

r 21 When U is "about 10" then V is "about 20" 

r 22 When U is "about 30" then V is "about 50" 

r 23 When U is "about 60" then V is "about 90" 

r 24 When U is "about 80" then V is "about 120" 

r 25 When U is "about 100" then V is "about 150" 

(we assume triangular fuzzy subsets) 

LEVEL #3 

r 31 When U is "low then V is "about 40" 

r 32 When U is ":meet" then V is "about 85" 

r 31 When U is "high" then V is "about 130" 

LEVEL #4 

r 4 « U is anything the V is 2u. 
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Having defined our knowledge base we now look at the performance of this system for various 
inputs; 

Case 1 : U = 75. At level one we get Tj = j hence since 

Gi(y) = go* T i(y) + G o(y)- 

Since Gq = then gj = 1 which give us Gj = Tj = ( ygg j- We now see that gj = 0 and hence no 

other rules will fire lower in the hierarchy. This system provides as its output for U = 75 that 

Vis 180. 

Case 2: U = 80. At level no rules fire, 3jj = 0 for all j. Thus Tj = <I> hence 

G l =gO* T l +Go = 0 
and therefore gj = 1. At level two 

g 2 = gl * T2 + 0 = T2- 

For U = 80 we assume that R 24 fires completely, 324 = 1 and that all other rules don't fire, 

$ 2 j = 0, for j * 2. Thus T 2 = "about 120" and G 2 = "about 120”. Since g 2 = 1 then g 2 = 0 and 
no rules at lower priority will fire thus G 2 , "about 120 ", is the output of the system for U = 80. 

Case 3: U = 20. No rule at level one will fire, hence Gj = Go = <D. At level two we shall assume 
that R 21 fires to degree .3 and R 22 also fires to degree .3. Thus 
T 2 = .3 a Bj u .3 a B 2 = .3 a (Bi u B 2 ) 

T 2 (y)=.3A(B 1 (y)vB 2 (y)). 

We note Bj and B 2 are "about 20" and "about 30" respectively. Hence 

G 2 (y) = (1 - gl) * T 2 (y) + Gi(y) = T 2 (y) 

At level three R 31 fires to degree 1 while R 31 and R 3 2 don't fire at all. Hence 

T 3 = "about 40" 

Since Max [G 2 ] = .3 thus 1 — g 2 = -7 and therefore 

G 3 (y) = .7 * T 3 (y) + G 2 (y) 

Since Max T 3 (y) = 1 we see that the process stops here and G 3 is the output of the system. 

What we see with this HPS representation is that we have our most general rule stored at the 
lowest level of priority and we store exceptions to this rule at higher levels of priority. In some 
cases the exceptions to general rules may themselves be rules, we would then store exceptions to 
these rules at still higher levels of priority. As the previous example illustrates in the HPS system 
for a given input we first look to see if the input is an exception, that is what we are essentially 
doing by looking at the high priority levels. 

4.Learning in the HPS 

The HPS representation is a formulation that has an inherent structure for a natural human 
like learning mechanism. We shall briefly describe the type of learning that is associated with this 
structure. 

Information comes into the system in terms of point by point knowledge, data pairs between 
input and output. We store these points at the highest level of priority. Each input/output pair 
corresponds to a rule at the highest level. If enough of these points cluster in a neighborhood in the 
input/output space we can replace these points by a general rule (see figure 2 ). 

Thus from the dots, input/output pairs, we get a relationship that says ifU is in A then V is 
B. We can now forget about the dots and only save the new relationship. We save this at the next 
lowest priority in the system, in subbox 2 . 

We note that the introduction of the rule essentially extends the information contained in the 
dots by now providing information about spaces between the dots. We can also save storage 
because we have eliminated many dots and replaced them by one circle. One downside to this 
formulation is that in generalizing we have lost some of the specificity carried by the dots. 


52 



♦ *« • 



u u 
Figure 2 Formulation of Rules for Input/Output Pairs 

It may occur that there are some notable exceptions to this new general rule. We are able to 
capture this exception by storing them as high level points. 

We further note that new information enters the system in terms of points. Thus we see that 
the points are either new information or exceptions to more general rules. Thus specific information 
enters as points it filters its way up the system in rules. 

We see that next that it may be possible for a group of these second level rules to be 

clustered to form new rules at the thiid level. 

In figure #3 the large bold circle is seen as a rule which encompasses the higher level rules 
to provide a more general rule. The necessity to keep these more specific rules, thus in level 2, 
depends upon how good the less specific rule captures the situation. 
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Figure 3 Aggregation of Rules into More General Rules 
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